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TITLE: SYSTEM AND METHOD FOR RECEIVING AND RENDERING MULTI- 

LINGUAL TEXT ON A SET TOP BOX 

BACKGROUND OF THE INVENTION 



1. Field of the Invention 

This invention relates to digital television systems and more particularly to the receiving and 
rendering of multi-lingual text on set top boxes of digital television systems. 

2. Description of the Related Art 



The emerging technology of digital television systems holds a promise of allowing a television set 
to provide a vast array of new services. Digital television systems are capable of displaying text and graphic 
1 5 images in addition to typical video program streams. An example of digital television services which make 
use of text and graphic image display is interactive television. Proposed features of interactive television 
accommodate a variety of marketing, entertainment and educational capabilities such as allowing a user to 
order an advertised product or service, compete against contestants in a game show, or request specialized 
information regarding a televised program. 
20 Typically the interactive functionality is controlled by a set top box connected to a television set. 

The set top box executes an interactive program written for a television broadcast. The interactive 
functionality is displayed upon the television set screen and may include icons or menus to allow a user to 
make selections via the television's remote control. 

Interactive television, and other broadcast communication systems in general which deliver textual 
25 data, often require support for multiple languages. For example, program guides are advertising tools used 
by program providers may desire to include descriptions of television programs in multiple languages, such 
as English mixed with Japanese. In addition, end users may receive data from non-native regions, such as a 
Chinese broadcast being received by a television viewer in India. 

It is highly desirable that a set top box owner be able to use the same set top box to receive textual 
30 information in more than one language. That is, it is desirable that the user not have to buy a different set top 
box to receive textual information in each different language. A language, in this context may be defined as 
a written system of representing thoughts, ideas, actions, etc. A language includes, inter alia, a grammar, 
characters, and words. 

The characters and symbols used in writing a language are commonly referred to as a "writing 
35 system", or "script." Many languages, such as Western European languages, are written with alphabetic and 
numeric characters. However Japanese, for example, is written with phonetic Hiragana and Katakana 
characters as well as alphabetic and numeric characters from Western languages and the ideographic Kanji 
characters which are largely taken from the Chinese language. The scripts of many languages may share 
common characters, as in the Western European languages. 

40 

1 
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Tlie textual information received by a set top box includes strings of characters. A "character" is an 
atomic symbol in a writing system. In alphabetic languages, this symbol consists of a single letter of the 
alphabet In ideographic languages such as Chinese and Japanese a cliaracter could be alphabetic, phonetic, 
or ideographic. 

5 A "'character set" is a group of characters used to represent a particular language or group of 

languages. A ''character encoding" is a system for numerically representing the characters of a character set 
A well-known example of a character encoding is the ASCII character encoding. Tlie numeric value 
associated with a given character in a character set is referred to as a "code point", or "encoding value." The 
set of numeric values associated with a code set is referred to as a "code set" 

10 The ASCII cliaracter encoding provides an encoding for a character set of the alpliabet numbers, 

and other characters used in the English language. The ASCII code set includes the values 0 - 127. Thus, 
each ASCII character has a unique assigned value which may be contained in 7 bits of a byte of data. For 
example, the character 'A* has a value 0x4 1 associated with it in ASCII. Many software library routines 
liave been developed to manipulate, read and write strings of ASCII characters. 

1 5 Other character encoding sets exist, which provide support for multiple languages, such as the ISO 

Latin character encoding which is used to represent many of the alphabetic languages in the world. ISO 
Latin includes a Basic Latin portion range of values (0 - 127) and an Extended Latin portion (values 128- 
255). 

Another example of a character encoding is the Japanese Industrial Standard (JIS) character 
20 encoding. JIS uses a 7-bit multi-byte encoding meclianism to represent Japanese text. 

A cliaracter encoding which enables the representation of cliaracters from many different languages 
and cliaracter sets using a single encoding scheme is referred to as a '"multi-lingual" character encoding. An 
example of a multi-lingual character encoding is the EUC (Extended UNIX Code) cliaracter encoding 
standard. EUC is typically used to represent ideographic Asian languages in tiie UNIX environment. EUC 
25 combines single byte ASCII characters with multi-byte ideographic cliaracter encodings. However, EUC 
allows only a few languages to be encoded at a time. 

Developing new software library routines to deal with strings in multiple character encodings 
and/or multiple languages may be prohibitive in terms of cost and time. Furthermore, it may be prohibitive 
in terms of storage space and/or code maintenance to support libraries to handle characters in multiple 
30 character encodings and languages. 

Some scripts combine characters to form composed characters whose shape is determined by the 
relative positions of the characters, i.e., the context of the characters. Examples of these "contextual scripts" 
are scripts for the Arabic, Hebrew, Thai, and all Indie languages. In contrast, "non-contextual scripts", such 
as the Roman alphabet used in Western languages, represent each character as a separate object of fixed 
35 shape, independent of the position in a word and of the neighboring characters. 

Each character of a character set has a unique shape which distinguishes it from other characters in 
the character set, that is, which allows a reader to distinguish the character from other characters and thus 
unambiguously convey information The shape assigned to a particular character is referred to as the 
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"glyph" of the cliaracter. The English letter 'A*, for example, has a unique glyph which makes it 
recognizable from other characters. 

Glyphs may have a particular style associated with them That is, an English k A* may be written in 
many different styles, such as in a block style or a calligraphic style. However, the style maintains the basic 
5 shape of the cliaracter such that tlie glyph is still recognizable as an * A. ' A collection of glyphs sliaring a 
common style is referred to as a "font" Examples of common fonts are Courier, Times Roman, and 
Helvetica. 

A variety of glyph representation schemes exist. A common scheme is a bitmap glyph, or font. In a 
bitmap font, the glyph of a given cliaracter includes a sequence of bits corresponding to an array of pixels on 
1 0 a display screen. Each bit indicates if the corresponding pixel is to be illuminated or not based on the value 
of the bit The pixel array has a characteristic width and height. For example, a glyph may be 24 pixels wide 
and 24 pixels Iiigh. In tliis example, 576 bits, or 72 bytes, of storage are required to store tlie glyph. 

If the glyplis in a font are tlie same number of pixels in width, the font is said to be a non- 
proportional font. If the width is variable, the font is said to be a proportional font. Another common glyph 
1 5 representation sclieme is an outline font. A property of outline fonts is that they typically facilitate scaling 
and rotating. 

A set top box receives text encoded according to a character encoding and displays the text on a 
television. The act of processing the image of a character, i.e., the glyph associated with the cliaracter, and 
displaying the character is referred to as ''rendering." A rendering program must use font type information, 
20 size infonnation, and potentially contextual information in order to properly render a given character in a 
given script. 

Transmission bandwidth in digital broadcast systems is a precious commodity. Hence, there is a 
motivation to minimize the number of bytes transmitted to the set top box with regard to the displaying of 
text. 

25 Languages wliich have a relatively large number of characters, such as Chinese, Japanese, and 

Korean, pose particular problems in the context of text processing and rendering in digital television 
systems. One problem is the large time to search through such a large set of characters to find a glyph 
associated with a given code point The combined Chinese, Japanese, and Korean character sets constitute 
over 120,000 characters. Secondly, the amount of memory required to store fonts and/or transmission 

30 bandwidth required to transmit fonts may be costly. 

In many circumstances, set top boxes are a commodity item. Hence, a multi-lingual capable set top 
box which costs significantly more than a uni-lingual set top box may not be accepted readily in the market 
place. On the other hand, the set top box must deliver performance which is acceptable at a given cost. 
Thus, the factor of cost versus performance figures in to the design of a set top box. 

3 5 Two components of a typical set top box which have a large bearing on its cost are its memory and 

processor. If multiple languages are supported, particularly if the languages have a large number of 
characters, such as Chinese, Japanese, or Korean, a large amount of memory may be required to store tlie 
fonts for tlie languages. More powerful processors provide higher performance of functions such as 
character lookup and rendering, but at a greater cost 



BNSDOCID: <WO 9821B90A1_L> 



WO 98/21890 




PCT7US97/20858 



SUMMARY OF THE INVENTION 

The problems outlined above are in large part solved by a system and method of the present 
invention for receiving and rendering multi-lingual text on a set top box of a digital television system In one 
5 embodiment, the system comprises a set lop box which is configured to receive text the characters of which 
are encoded according to a multi-lingual character encoding standard, "Unicode". The set top box is further 
configured to process the Unicode text, and render the text for display on a television coupled to the set top 
box. The set top box is configured with an operating environment which accepts language-specific glyph 
sets to be modularly ''plugged in" to the set top. One or more glyph sets can be plugged into the set top box 

1 0 to support one or more languages as desired. Glyjphs or glyph sets may be downloaded into the set top box 
along with the application program in the event that a given glyph is not present in the set top box. The set 
top box may also employ an improved hashing metliod for efficiently storing and quickly retrieving 
characters of a language with large number of characters, such as Japanese. 

Anapplication developer develops an application program, such as an interactive TV program, 

1 5 using development tools and libraries such that the textual data in the application program are Unicode 

characters. Preferably, the textual data is included in a resource file, which is separate from the instructions 
of the application program. A broadcast center mixes the application program, including tlie resource file, 
with a digital audio/video data stream. The audio/video stream includes the data for playing the television 
program or commercial to be shown on the user's televisioa Typically, the audio/video stream is 

20 compressed using a compression algorithm such as one of the Motion Picture Expert Group (MPEG) 

compression standards. The broadcast center transmits the data stream to the set top box. The data stream is 
transmitted by a suitable transmission meclianisin, such as via satellite or coaxial cable. 

The set top box receives the stream of digital data from the broadcast center. Tlie set top box de- 
multiplexes the audio/video stream portion from the application program and stores the application program 

25 in local memory of the set top box. The set top box decompresses the audio/video data stream for display on 
the television. A processor in the set top box executes the application program 

The operating environment running on the set top box is configured to manage the different tasks, 
such as the application program, which are executed by the set top box. Preferably, the operating 
environment includes amnterpreter which interprets code instructions which are processor independent. 

30 Preferably, the application program is interpreted by the interpreter. 

The interpreter includes a Unicode encoding engine which includes library functions for 
manipulating and printing Unicode character strings. The application program calls the Unicode character 
string functions to perform string manipulations such as determining Unicode string lengths, copying 
Unicode strings and connecting Unicode strings. The application program also calls string display functions 

35 of the Unicode engine. 

The interpreter further comprises a language detector. The Unicode engine invokes the language 
detector to determine a language associated with a given character of the Unicode string. The Unicode 
engine uses the language and the font set by the application program to determ ine l which of the one or more 
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glyph sets of the set top box includes the glyph for the character. Tltc interpreter further includes one or 
more rendering engines for rendering glyphs of a given language and font. 

Glyplis or glyph sets may also be downloaded to the set top box as needed. If the application is 
configured to display a glyph which is not present in the set top box, i.e., not plugged-in to tlic set top box, 
5 the glyph may be downloaded along with the application to the set top box. The Unicode engine detects a 
condition where a glyph referenced by the application is not burned in to tlie set top box, and searches a list 
of downloaded glyphs to detect the presence of the referenced glyph. If the Unicode engine detects the 
presence of the downloaded glyph, the Unicode engine invokes tlie appropriate rendering engine to render 
the downloaded glyph. 

1 0 Each rendering engine is configured to render strings of characters according to the rendering rules 

for its particular language and font For example, a rendering engine for a contextual language knows how to 
render cliaracters in a string based on the context of each character. Furthermore, a rendering engine may 
have specific knowledge about the standards of a given region, such as regarding time, date, and currency 
symbols. Furthermore, a rendering engine must know the direction in which the cliaracters are to be 

1 5 rendered. For example, an Arabic rendering engine would render tlie characters from right to left, whereas a 
French rendering engine would render the characters from left to right. 

Tlie glyph sets are preferably arranged in a manner conducive to efficient storage and retrieval of 
tlie glyphs in the glyph set, according to tlie characteristics of tlie language associated with tlie glyph set 
Glyph sets for languages with a large number of cliaracters may be stored and retrived using a hash table 

20 according to a hashing method. The hash method may yield a relatively small maximum number of 
collisions with a large percentage of the code points hashing to elements with approximately half tlie 
maximum number of collisions or less. 

Each glyph set has an associated rendering engine. Tlie Unicode engine invokes the appropriate 
rendering engine to process and render each Unicode string of the text A rendering engine renders a 

25 character by receiving a glyph associated with a Unicode character and populating a pixel map according to 
tlie glyph information. A pixel map is a string of bits indicating the state of each pixel in an array of pixels. 
For example, in the case of a bitmap glyph of a non-contextual language, rendering the glyph includes 
copying the glyph bit map to the appropriate location in memory. The pixel map may further include other 
property information, such as color. 

30 The rendering engine processes and renders characters of the string until the rendering engine 

encounters a character which does not belong to its language. If the rendering engine did not process the 
entire string, tlie Unicode engine updates the string pointer to point to tlie next character in the string which 
was not processed by the rendering engine, invokes the language detector to determine the language 
associated with that cliaracter, and invokes the appropriate rendering engine. The process continues until all 

35 the text has been rendered. 

The rendering engines pass tlie pixel maps to a graphics driver which controls the video hardware 
of the set top box. The graplucs driver provides the pixel maps to the video hardware of the set top box such 
that the text is displayed in the appropriate coordinates on the television display screen. 

5 
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The set top box multiplexes the decompressed audio/video stream with the rendered text and 
displays the audio/video information and rendered text on tltc television. 

Thus, the television system and method may advantageously provide a means for receiving and 
rendering text in multiple languages, and do so in a manner which maximizes code reusability thus 
5 minimizing development and maintenance time and cost by providing the ability to process text including 
characters in a universal character encoding. The system and method may further minimize the broadcast 
bandwidth required to receive and render multiple languages by providing pluggable language-specific 
modules. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Other objects and advantages of the invention will become apparent upon reading the following 
detailed description and upon reference to Uie accompanying drawings in which: 

1 5 Fig. 1 is a block diagram of a television system according to the present invention; 

Fig. 2 is a block diagram of the set top box of the system of Fig. 1 ; 

Fig. 3 is a block diagram illustrating the flow of data in the system of Fig. 1 ; 

20 

Fig. 4 is a flowchart illustrating steps taken in developing and transmitting an application program 
in the system of Fig. 1 : 

Fig. 5 is a block diagram of the software modules of the set top box of Fig. 2; 

25 

Fig. 6 is a block diagram illustrating in more detail portions of the interpreter of Fig. 4; 

Fig. 7 is a flowchart illustrating steps taken in receiving and rendering multi-lingual text in the 
system of Fig. 1; 

30 

Fig. 8 is a flowchart illustrating in more detail the step of processing text in Fig. 7; 

Fig. 9 is a block diagram illustrating the inputs and outputs of a rendering engine of Fig. 5 ; 

35 Fig. 10 is a block diagram illustrating data structures used in the hashing method of the present 

invention according to the preferred embodiment; 

Fig. 1 1 is a block diagram illustrating data structures used in the hashing method of the present 
invention according to an alternate embodiment; 

6 
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Fig. 1 2 is a block diagram illustrating data structures used in the hashing method of the present 
invention according to an alternate embodiment; 



Fig. 13 is a flowcliart illustrating steps taken to efficiently store a cliaracter set in the set top box of 

5 Fig. 2; 

Fig. 1 4 is a flowchart illustrating steps taken to quickly retrieve a cliaracter from the set top box 
stored according to the method of Fig. 1 3. 

1 0 Wliile the invention is susceptible to various modifications and alternative forms, specific 

embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It 
should be understood, however, that the drawings and detailed description thereto are not intended to limit 
the invention to the particular form disclosed, but on tlie contrary, the intention is to cover all modifications, 
equivalents and alternatives falling within the spirit and scope of the present invention as defined by tlie 

1 5 appended claims. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring now to Fig. 1, a block diagram of a television system 10 according to one embodiment is 

20 shown. The system 10 comprises a broadcast center 12 which transmits a stream of digital data to a set top 
box 18, also referred to as a digital interactive decoder (DID). Preferably, the broadcast center 12 transmits 
the digital data stream to a satellite 14 which transmits the digital data stream to an antenna 16 coupled to tlie 
set top box 1 8. In one embodiment the broadcast center 1 2 transmits tlie digital data stream to the set top 
box 1 8 via a cable, such as a coaxial or fiber optic cable. Tlie set top box 1 8 receives the digital data stream 

25 from the antenna 16 or cable and displays a portion of the information from the digital data stream on a 
television 20 coupled to the set top box 18. 

Tlie set lop box 1 8 receives user input from a remote control 28. Preferably, tlie set top box 1 8 
provides a portion of the user input to a transaction server 26. For example, tlie set top box 1 8 may display a 
menu for ordering a product, such as a hammer. The user may provide input indicating the desire to 

30 purchase the hammer. The set top box 1 8 provides the purchase information to tlie transaction server 26 

which forwards the purchase information to the hammer manufacturer or distributor so that the product may 
be distributed and billed to the user. 

The digital data stream comprises an audio/video portion and an application program portion. The 
audio/video portion comprises tlie digital audio and video information for a television program or television 

3 5 commercial to be displayed on the television 20. Preferably, tlie audio/video stream is compressed using a 
common compression algorithm such as MPEG 2. 

The application program portion of tlie digital data stream comprises instructions and data to be 
executed on the set top box 1 8. Preferably, the application program is configured to display text on the 
television 20 which is coordinated with the television program or television commercial of tlie audio/video 

7 
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dala stream displayed on die television 20. For example, the application program may execute instructions 
to display a menu for ordering the hammer. The audio/video data stream portion and the application 
program portion are mixed together, preferably in die broadcast center 1 2 ? to produce the digital data stream 
transmitted to die set top box 1 8. 

Preferably, the application program includes textual information such as a menu, including strings 
of characters in multiple languages. The set top box 18 is configured to receive the application program and 
process the strings of characters of the application program and render die characters for display on the 
television 20. 

A video cassette recorder (VCR) 24 may also be coupled to the set top box 1 8. The set top box 1 8 
may control the VCR 24 to perform actions according to application programs downloaded to the set top box 
1 8. An example is the set top box 18 controlling the VCR 24 to perform automated recording. 

A computer 22 may also be coupled for communication with the set top box 1 8. The computer 22 
may also download application programs to the set top box 1 8. Furtlier, the set top box 1 8 may use 
resources of the computer 22, such as a hard disk, as permanent storage. The computer 22 may be locally 
connected, such as tlirough a serial connection, or remotely connected via a telephone line. 

Referring now to Fig. 2, a block diagram of the set top box 1 8 of Fig. 1 is showa Set top box 1 8 
comprises CPU 40 coupled to a read-only memory (ROM) 30. The ROM 30 includes instructions and data 
for executing on die CPU 40. A random access memory (RAM) 32 is coupled to the CPU. The RAM 32 is 
used for storing program variables for the program instructions contained in the ROM 30. The RAM 32 is 
also configured to store the application program received from the broadcast center 12 (of Fig. 1). A 
FLASH memory 34 is also coupled to the CPU 40 and contains program instructions for execution on the 
CPU 40 and/or cliaracter glyphs used in rendering text characters on the television 20 (of Fig. 1 ). 

The CPU 40 comprises a microprocessor, micro-controller, digital signal processor (DSP) ? or otiier 
type of software instruction processing device. The CPU 40 fetches instructions from the ROM 30, RAM 
32, and/or FLASH 34 and executes the instructions. 

The ROM 30 comprises read only memory storage elements as are well known in the art of solid 
state memory circuits. Preferably, die ROM 30 comprises read only memory storage which is programmed 
and plugged in to the set top box 18. 

The RAM 32 comprises dynamic random access memory (DRAM) or static random access 
memory (SRAM) storage elements as are well known in tlie art of solid state memory circuits. 

The FLASH memory 34 comprises writable permanent memory storage elements as are well 
known in the art of solid state memory circuits. Preferably, the FLASH 34 comprises memory storage which 
may be programmed, Le., written, during operation of the set top box 18. 

A security device 36 is also coupled to the CPU 40 for providing authentication and signature 
functionality. For example, the security device allows the enabling or disabling of application program 
downloading to die set top box 1 8. 

A communications port 42 is coupled to the CPU 40 and is configured to provide communication 
with other devices such as the computer 22 (of Fig. 1), the VCR 24 (of Fig. 1), or other devices such as a 
keyboard. A remote control port 44 is coupled to the CPU 40 and is configured to receive remote input such 
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25 



30 



as from the remote control 28 (of Fig. 1) or from a front panel on the set top box 1 8. Preferably, the remote 
port 44 comprises an infra-red receiver for receiving infra-red signals from the remote control 28. A modem 
46 is coupled to the CPU 40 and is configured to provide communication between the set top box 1 8 and the 
transaction server 26 (of Fig. 1). 

A demultiplexer 38 is coupled to the RAM 32 and is configured to receive the digital data stream 
from a receiver 50 coupled to the demultiplexer 38. The receiver 50 receives the digital data stream from 
the broadcast center and communicates the digital data stream to the demultiplexer 38. Hie demultiplexer 
38 is configured to demultiplex the application program from the audioA'ideo data stream of the digital data 
stream received from the broadcast center 1 2 and store the application program in the RAM 32. 

The CPU 40 executes the application program stored in the RAM memory and provides rendered 
textual data from the application program to a video encoder/multiplexer 48. The video encoder/multiplexer 
48 multiplexes the rendered text with the audio/video stream and provides the multiplexed rendered text and 
audio/video stream to the television 20 for display on the television. 

Referring now to Fig. 3, a block diagram illustrating the flow of data in the system 10 of Fig. 1 is 
shown. The broadcast center 12 (of Fig. 1 ) receives an audio/video stream 66 and an application program 
64, multiplexes the audio/video stream 66 and the application program 64, and transmits the multiplexed 
data stream to the set top box 1 8. Preferably, the language specific textual information of the application 
program 64 is included in a resource file 62 which is transmitted as part of the application 64 to the set top 
box 18. By separating the language specific portion of the application 64 in the resource file 62 the 
application 64 advantageously provides a means whereby changing the textual information from one 
language to another requires only modification to the resource file 62 rather than to tlie application 64. 

The set top box 1 8 receives the digital data stream and the demultiplexer 38 of the set top box 1 8 
demultiplexes the application 64 from the audio/video stream 66a. The audio/video stream 66 may have 
been compressed according to a lossy compression algorithm, hence the decompressed audio/video stream 
66a may be in some manner different from the initially transmitted audio/video stream 66. 

The application program 64 is executed by an operating environment 70 of the set top box 1 8. The 
application program 64 executes within the operating environment 70 to display rendered textual 
information which is received by the video multiplexer 48 (of Fig. 2) along with the audio/video stream 66a. 
Video multiplexer 48 multiplexes the rendered text information in the audio/video stream 66a and provides 
the multiplexed information to the television 20 for display on the television 20. 

Referring now to Fig. 4, a flowchart illustrating steps taken in developing and transmitting an 
application program in the system of Fig. 1 is showa An application program developer develops an 
application such as the application 64 of Fig. 3 in step 102. Preferably, the application program developer 
does not include the textual information to be displayed on the television 20 (of Fig. 1) in the application 
program, but rather places the textual information in a resource file such as the resource file 62 of Fig. 3, and 
includes in the application program 64 references to the textual information. 

The application programmer creates a resource file 62 in step 104. The resource file 62 includes 
formatted chunks of data which may be attached to the application program 64 to avoid embedding tlie data 
directly into tlie application program 64. The resource file 62 advantageously simplifies maintenance and 
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modification of the application program 64 since the data may be changed in the resource file 62 without 
modification to the tested and debugged application program 64. 

Alternatively, the textual information may be contained in the application program 64 itself. 
Preferably, the textual information comprises strings of characters wherein the cliaractcrs arc from the 
5 Unicode cliaracler set. Preferably, the application developer creates the application 64 and/or resource file 
62 using a Unicode-capable text editor, or some other suitable Unicode-capable tool. 

Unicode is a multi-lingual character encoding wluch attempts to include all current written scripts 
for all current languages. Each character in the Unicode character set is represented by a 16-bit value or 
code point, thus allowing a character set of 65,536 characters. Unicode is part of the ISO 10646 standard. 
1 0 ISO/IEC 1 0646- 1 : ] 993(E) defines the Unicode standard and is hereby incorporated by reference. 
C:' Preferably, creating the resource file in step 104 comprises optionally including one or more glyphs 

f| for particular characters referenced by the application program to be displayed on the television 20. UHie 
jjj gt^pfr ©fca particular character is not already part of a glyph set of the set top box 1 8, the glyph may^be 
| downloaded to the set top box 1 8 with the application program 64. 

its Thus, the resource file 62 of the system advantageously provides a means for providing glyphs to 

; the set top box 18. In particular, the means of downloading a glyph to the set top box 1 8 is advantageous for 
| rendering characters which are infrequently used^rspeciai eharacters^thus. allowing the saving of memory 
storage within the set top box and potentially reducing the cost of the set top box 1 8. Furthermore, it may be 
desired to render a character of a language which is not present in the set top box 1 8. In tliat situation, the 
20 glyph for the Unicode character from the language not present in the set top box 1 8 may be downloaded with 
the application program to the set top box 18. 

Once the application program and resource file are developed they are provided to the broadcast 
center 12 which transmits the application to the set top box in step 106. 

Referring now to Fig. 5, a block diagram of the software modules of the set top 1 8 box of Fig. 2 are 
25 shown. The application program 64 (of Fig. 3) communicates with tlte operating environment 70 (of Fig. 3) 
which in turn communicates with the set top box hardware 1 8 to display textual information of the 
application program 64 on the television 20. The operating environment 70 comprises device drivers 76 for 
communicating with and controlling the set top box hardware 1 8. A microkernel 72 provides system 
services to the various components of the operating environment 70 such as memory management, task 
30 management, and communication between tasks, such as the application program 64, and the device drivers 
76. 

Preferably, the application program 64 comprises instructions which may be interpreted by an 
interpreter 74. In one embodiment, the interpreted instructions in the application program are referred to as 
o-code and the interpreter 74 is an o-code interpreter, o-code comprises a stack based instruction set. 
35 Preferably, the interpreter 74, the microkernel 72, and the device driver 76 of the operating environment 70 
reside in the ROM 30 (of Fig. 2) of the set top box 1 8. Advantageously, interpreter 74 interpreting tlie o- 
code of the application 64 provides a means of developing applications 64 which are independent of the 
underlying CPU 40 (of Fig. 2) hardware. 
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Tlic interprclcr 74 includes function libraries which are accessible by ihc application program 64 
for performing functions such as allocating memory, manipulating memory, and providing user interface 





management. 



Referring now to Fig. 6, a block diagram illustrating in more detail portions of the interpreter 74 of 



5 Fig. 4 are shown. An application program 64 executing on the set top box 1 8 communicates with the 

interpreter 74 which in turn communicates with a graphics driver 76a of the device drivers 76 (of Fig. 5) to 
process and render Unicode text for display on television 20 (of Fig. 1). The interpreter 74 comprises a 
Unicode encoding engine 84. 



10 perform numerous string manipulation functions such as detennining the length of a Unicode string, copying 
a Unicode string from one location to another, concatenating two Unicode strings together, comparing two 
Unicode strings to determine if the Unicode strings are identical, and searching a Unicode string for an 
occurrence of a particular Unicode cliaracter within the Unicode string. Preferably, a Unicode string is 
defined as one or more Unicode characters terminated by a null Unicode character. 

1 5 The Unicode encoding engine 84 further comprises functions which the application program 64 

invokes to set the current font of the Unicode text to be rendered. The Unicode encoding engine 84 further 
comprises functions which the application 64 invokes for displaying Unicode text on the television 20. 

When the application program 64 invokes a function of the Unicode encoding engine 84 to display 
Unicode text, the Unicode encoding engine 84 invokes a language detector 82 in order to determine a 

20 language associated with cliaracters of the U nicode text received from the application program 64 . The 

language detector 82 informs the Unicode encoding engine 84 of the language associated with tlie Unicode 
character passed to tlie language detector 82 by the Unicode encoding engine 84. 

The interpreter 74 comprises one or more glyph sets 94a-94f, referred to collectively as 94. The 
Unicode encoding engine 84 uses the language information returned by the language detector 82 along with 

25 font information set by the application program 64 to determine which one of the glyph sets 94 includes a 
glyph for describing the particular Unicode cliaracter to be rendered. 



30 information from tlie glyph sets 94 in order to render Unicode characters. The Unicode encoding engine 84 
invokes the appropriate rendering engine from the rendering engines 92 configured to render the particular 
glyph from one of the glyph sets 94 corresponding to the given Unicode character to be rendered. 

As shown, a given rendering engine 92 may be configured to render glyphs from a plurality of 
glyph sets 94. For example, a rendering engine 92 which renders bitmap glyphs of a fixed pixel height and 

3 5 pixel width for non-contextual languages which render cliaracters from left to right may render characters for 
most of the Western European languages. 

The Unicode encoding engine 84 is further configured to determine the absence of a glyph in the 
glyph sets 94 for a given cliaracter in a language and detect the presence of a downloaded glyph 97 
corresponding to a given Unicode cliaracter to be rendered. The downloaded glyphs 97 are downloaded to 



The Unicode encoding engine 84 provides functions which the application program 64 invokes to 



Hie interpreter 74 further comprises one or more rendering engines 92a-92n, referred to 
collectively as 92. Each of the rendering engines 92 is configured to render Unicode cliaracters 
corresponding to a particular language and/or font As shown, the rendering engines 92 receive glyph 
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the set top box 1 8, preferably in the resource file 62 (of Fig. 3) along with the application program 64. The 
downloaded glyphs 97 are placed in the RAM 32 (of Fig. 2) in a list for access by the Unicode encoding 
engine 84 and rendering engines 92 in rendering Unicode cliaracters not present in the glyph sets 94. 

Tlic interpreter 74 may contain as few as one glyph set 94 and one rendering engine 92. 
Advantageously, glyph sets 94 may be modularly added to the interpreter 74 as required to accommodate 
various languages and fonts. 

For example, the set top box 1 8 (of Fig. 2) may be configured with three gljph sets: a glyph set for 
a 1 6 point "Courier" English font, a 24 point Courier English font glyph set, and 24 point Japanese font 
glyph set The set top box 1 8 is further configured with a rendering engine for rendering each of the three 
glyph sets. 

Preferably, configuring the set top box 1 8 in the manner described with modular glyph set modules 
comprises compiling and linking together the various portions of the operating environment 70 along with 
the desired glyph sets and programming the operating environment into the ROM 30 (of Fig. 2). Thus, the 
set top box 1 8 may be tailored specifically to support the desired languages in a given geographic locale. 
This localization advantageously enables the same set top box hardware 18 and large portions of the 
operating environment 70 to be reused without modification. Thus, development time and resources are 
decreased and the cost of the set top box liardware is reclaimed. 

Once the rendering engines 92 have rendered the Unicode characters and produced strings of bits 
in tlie RAM memory 32 of the set top box 1 8 (of Fig. 2), the rendering engines 92 invoke the graphics driver 
76a to display the rendered text on the television 20. The graphics driver 76a interacts with the video 
liardware of the set top box 1 8 to display the rendered text along with the audio/video data of the television 
program or commercial on the television 20. 

Hie interpreter 74 further comprises an 8-bit encoding engine 86 for handling 8-bit character 
encoding strings such as ASCII text The application program 64 invokes string functions of the 8-bit 
encoding engine 86 in order to manipulate and display 8-bit character encoding characters on the television 
20. Thus, the 8-bit encoding engine 86 performs functions similar to the Unicode coding engine 84 but for 
8-bit encoded character strings rather tlian for Unicode encoded character strings. 

Each of the rendering engines 92 is configured to render Unicode text according to rendering rules 
for the particular language associated with each of the rendering engines 92. For example, a rendering 
engine associated with a language which is a contextual language has knowledge about how to render 
characters of a string based on the context of the given character. For example, an Arabic rendering engine 
contains knowledge about particular kerns or ligatures used in connecting Arabic characters based on 
neighboring characters. 

Furthermore, a rendering engine has specific knowledge regarding the direction in which characters 
are rendered. For example, a Hebrew rendering engine renders characters from right to left whereas a 
French rendering engine renders characters from left to right Furthermore, rendering engines have specific 
knowledge about standards of a given locale such as standards for displaying times, dates and currency 
symbols, for example. 
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A glyph set from one of the glyph sets 94 comprises a plurality of glyphs organized in a manner 
optimized for the particular language or set of glyphs in the glyph set. The glyph sets are organized to 
optimize the time required for look-up of a given Unicode cliaractcr, as well as to optimize the amount of 
storage required in order to store the glyph set. Thus, two different glyph sets may be organized in two 
5 different manners. 

For example, a glyph set comprising a relatively small number of glyphs may be arranged as a 
simple indexed array A glyph set comprising a relatively large number of characters such as a Japanese, 
Chinese, or Korean glyph set may be arranged in a more sophisticated manner such as by using a hash table. 
Furthermore, glyph sets for contextual languages may include multiple tables according to context. The 
1 0 glyph sets inay also be arranged according to glyph representation such as bit-mapped glyphs, outline glyphs, 
stroke glyphs, etc. 

Referring now to Fig. 7, a flowcliart illustrating steps taken in receiving and rendering multi-lingual 
text according to llie system of Fig. 1 is shown. The set top box 1 8 (of Fig. 1 ) receives a digital data stream 
including an application program and audio/video information from the broadcast center 12 (of Fig. 1 ) in 
1 5 step 110. The demultiplexer 38 (of Fig. 2) demultiplexes the application program 64 (of Fig. 3) from the 
audio/video stream 66 (of Fig. 3) and stores the application program 64 in the RAM memory 32 (of Fig. 2) 
instep 112. 

The operating environment 70 (of Fig. 3) determines if a resource file 62 (of Fig. 3) is present with 

tlie application 64, and if so, determines if the resource file 62 includes glyphs for rendering Unicode 
20 characters in step 113. If so, the operating environment 70 places the downloaded glyphs into a list of 

downloaded glyphs 97 (of Fig. 6) for future use by one of the rendering engines 92n (of Fig. 6) in step 115. 
The operating environment 70 executes the application program 64 on the CPU 40 (of Fig. 2) in 

step 114. The application program 64 calls functions of the operating environment 70 to manipulate and 

display text on the television 20 (of Fig. 3). Preferably, the text is encoded according to the Unicode 
25 character encoding. Preferably, the Unicode text is contained within the resource file 62 (of Fig. 3) of the 

application program 64. Preferably, the application program 64 comprises references to the Unicode text 

contained in the resource file 62. 

The Unicode encoding engine 84 (of Fig. 6) receives the Unicode text from the apphcation 

program 64 in step 116. The Unicode text received by the Unicode encoding engine 84 comprises one or 
30 more Unicode text strings. Preferably, the Unicode encoding engine 84 receives the Unicode text from the 

resource file 62. The Unicode encoding engine 84 in conjunction with other portions of the interpreter 74, 

processes the Unicode text for displaying the Unicode text in step 1 1 8. 

Referring now to Fig. 8, a flowchart illustrating in more detail step 118(ofFig.7) of processing the 

Unicode text is showa In processing the Unicode text received from the application program 64, the 
35 Unicode encoding engine 84 determines whether more Unicode strings exist in the Unicode text received in 

step 120. If no more Unicode text strings exist, the text lias been processed. If more strings exist, the 

Unicode encoding engine 84 sets a current character variable to reference the first character in the current 

string to be processed in step 122. 
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The Unicode encoding engine 84 then determines whether or not more characters exist in the 
current Unicode string in step 1 24. If no more Unicode cliaracters exist in the current string, the Unicode 
encoding engine 84 returns to step 120 to determine if any more strings exist in the text If more characters 
exist in the current string as determined in step 124 ; the Unicode encoding engine 84 invokes the language 
detector 82 (of Fig. 6) to determine the language of the current cliaracter in step 1 26. The Unicode encoding 
engine 84 invokes tlic appropriate one of the rendering engines 92 (of Fig. 6) associated with the language of 
the current character in step 128. 

The Unicode encoding engine 84 invokes the rendering engine by passing a reference to the current 
string to tlie rendering engine. The rendering engine renders characters in Hie string as long as each 
cliaracter encountered is a character in the language associated with the rendering engine in step 1 30. 

Once the rendering engine detects a character not in the language associated with the rendering 
engine, tlie rendering engine stops rendering characters of the Unicode string and returns to the Unicode 
encoding engine 84 information regarding which portion of tlie Unicode siring was rendered by tlie 
rendering engine. In step 132 tlie Unicode encoding engine 84 uses tlie information returned by the 
rendering engine concerning which characters of the string were rendered by the rendering engine to assign 
the current cliaracter variable to reference the character after tlie last character rendered by the rendering 
engine. The Unicode encoding engine then returns to step 124 to determine if more characters exist in tlie 
current string to be rendered. 

Thus, each character in the Unicode text received is rendered for display on the television 20 
according to the steps of the flowchart of Fig. 8. The steps advantageously enable the set top box 18 to 
process and render Unicode text comprising cliaracters of different languages. 

Referring now to Fig. 9, a block diagram illustrating the operation of a rendering engine such as tlie 
rendering engines 92 (of Fig. 6) is shown. A rendering engine 92a representative of the rendering engines 
92 receives a Unicode cliaracter 140 and glyph set information 94a representative of tlie glyph sets 94 (of 
Fig. 6) and generates a pixel map of the rendered character 95. The rendering engine 92a receives the code 
point of the Unicode character 140 and uses tlie code point of the Unicode character 140 to access a 
corresponding glyph in tlie glyph set 94a which describes the Unicode character 140. 

The pixel map 95 comprises a string of bits indicating tlie state of each pixel in an array of pixels 
such as the pixels of a television screen. The state of a pixel is either on or off. A reference to the pixel map 
95 is passed to the graphics driver 76a (of Fig. 6) and the graphics driver 76a uses the pixel map to display 
the rendered character on the television 20. 

The rendering engine 92a takes the description of the Unicode character 140 from the glyph of the 
glyph set 94a representing the Unicode character 1 40 and generates pixels in the pixel map 95 for display of 
the Unicode character 140. In the case of a bitmap glyph, rendering the glyph typically comprises copying 
the string of bits comprising the glyph to tlie pixel map. In particular, this is true in the case of a non- 
contextual language. 

In the case of a contextual language, the rendering engine 92a modifies the bitmap contained in the 
glyph to modify or create ligatures or kerns of the glyph based on the context, i.e., the neighboring characters 
in the string, to produce a modified glyph in the form of the pixel map 95. 
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In the case of outline glyphs, the rendering engine 92a uses the outline information to generate the 
pixel map 95. The rendering engine 92a uses the outline information from the glyph along with orientation 
and sizing information to render tlie character and produce the pixel map 95. The pixel map 95 may be 
further modified by other portions of the operating environment 70 to include other properties such as color 
information in the pixel map 95. 

Referring now to Fig. 1 0, a block diagram illustrating data structures used in the hashing method 
according to the preferred embodiment is shown. The Unicode encoding engine 84 (of Fig. 6) comprises 
one or more font objects. Font object 1 50 is a representative font object. The font object 1 50 is an object 
according to the notion of objects in object-oriented programming. The font object 1 50 comprises methods 
and data associated with the object. 

The font object 150 comprises a reference to a rendering engine 92a of the rendering engines 92 (f 
Fig. 6)which is a method of the font object 1 50. The Unicode encoding engine 84 uses the language 
information from tlie language detector 82 (of Fig. 6) to determine which font object is associated with the 
language and current font of the application 64 (of Fig. 3) in order to invoke the rendering engine 92a of the 
font object 150. 

The font object 1 50 norther comprises a reference to an arrangement of a glyph set, such as glyph 
set 94a of Fig. 6 which is data of the font object 150. Fig. 10 illustrates the arrangement of a glyph set using 
a hash table 1 60 and the font object 1 50 includes a reference to the hash table 160. The glyph set 
arrangement illustrated in Fig. 10 is particularly useful for efficiently storing and quickly retrieving Unicode 
characters for languages with a large number of characters, such as Japanese, Chinese, or Korean. 

The rendering engine 92a uses the glyphs 180a-180n of the glyph set 94a to render Unicode 
characters whose glyphs are present in the glyph set 94a. 

The hash table 160 includes an array of hash table elements 162a-162n, referred to collectively as 
162. Hash table entry 162a will be referred to as a representative hash table entry. The hash table 160 is 
indexed according to indexes calculated by a hashing method upon Unicode code points described infra. 
The rendering engine 92a calculates a hash table index and uses the index to calculate the appropriate hash 
table entry. 

Each hash table entry 162a comprises a hash bin list reference 164a-164n, referred to collectively 
as 164, and a hash bin count 166a-166n, referred to collectively as 166. Hash bin list reference 164a will be 
referred to as a representative hash bin list reference. Hash bin count 1 66a will be referred to as a 
representative hash bin count 

Each hash bin list reference 164a references a list of hash bins from the array of hash bins 1 72a- 
172n, referred to collectively as 172. Hash bin 172a will be referred to as a representative hash bin. The 
hash bin count 1 66a indicates the number of hash bins in the list of hash bins referenced by its associated 
hash bin list reference 1 64a. Fig. 1 0 shows example hash bin counts, e.g., hash bin count 1 66a is 4 and 
associated hash bin list reference 164a references a hash bin list comprising hash bins 1 72a- 1 72d, the first of 
which is hash bin 1 72a. The hash bin count 1 66a enables the rendering engine 92a to search a hash bin list 
and determine when the end of the hash bin list has been reached. 
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Each hash bin 172a comprises an encoding value 174a, which is representative of encoding values 
174a-174ru and a glyph reference 176a, which is representative of glyph references 176a- 176a The 
encoding value 1 74a is the code point for a Unicode ciiaracter. Tlie glyph reference 1 76a refers to a 
representative glyph 180a, in the glyph set 94a, describing the Unicode character whose code point is in the 
encoding value 174a field. Tlie rendering engine 92a uses Hie glyph 180a referenced by the glyph reference 
1 76a to render the Unicode character whose code point is in the encoding value 1 74a field. 

The operation of the font object 150, i.e., the use of the hash table 160 by the rendering engine 92a, 
will be described in more detail in the discussion of Fig. 1 4 and the creation of the liash table 160 will be 
described in more detail in Hie discussion of Fig. 13 

Referring now to Fig. 1 1 , a block diagram illustrating data structures used in tlie hashing method 
according to an alternate embodiment is showa The embodiment of Fig. 1 1 is similar to that of Fig. 10 and 
corresponding elements are numbered identically for simplicity and clarity. The structure of the embodiment 
of Fig. 1 1 is similar to that of Fig. 1 0 except that the glyph 1 80a for each Unicode character is included in me 
hash bin 1 72a associated with the Unicode character rather than the hash bin 1 72a having a reference to the 
glyph 1 80a. That is, the glyph set 94a is distributed among the hash bins 172. 

The embodiment of Fig. 1 1 has the advantage of using less memory storage space due to the 
absence of the glyph reference 176a of Fig. 10, but the potential disadvantage of having the glyph set 
distributed among the hash bins 172. 

Referring now to Fig. 12, a block diagram illustrating data structures used in the hashing method 
according to an alternate embodiment is shown. The embodiment of Fig. 12 is similar to that of Fig. 1 1 and 
corresponding elements are numbered identically for simplicity and clarity. The structure of the embodiment 
of Fig. 1 2 is similar to that of Fig. 1 1 except that the liash bin lists comprise linked lists of hash bins, rather 
than sequentially arranged lists in an array of hash bins according to Fig, 1 1 . 

Each hash bin 1 72a further comprises a next bin 1 90a field used to create the linked list of hash 
bins. The next bin 1 90a field refers to the next hash bin in the hash bin list or contains a NULL value 
indicating the end ofthe hash bin list The hash bin count 166a field (of Fig. 11) is absent from the hash 
table elements of Fig. 12 since the end of a hash bin list may be determined by the presence of a NULL value 
in the next bin 1 90a field of a hash bin 172a 

The embodiment of Fig. 12 has the advantage of being created using a simpler creation method, but 
the disadvantage of using more memory storage space due to the presence ofthe next bin reference. 

The hashing method solves the problem of mapping a relatively large set of potential input values, 
i.e., the entire Unicode code set, to a relatively smaller subset of values, i.e., the code set associated with the 
subset of Unicode characters used in the Japanese language, or other language with a relatively large number 
of characters, such as Chinese or Korean. 

One solution contemplated is to provide an array of glyphs indexed by the code point ofthe 
Unicode character, wherein the size of the array is the size ofthe Unicode code set, i.e., 65,536 array 
elements. However, this solution is very costly in terms of memory storage space. 
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Another solution contemplated is to provide an array of encoding value/glyph pairs, the size of 
which is the size of the language-specific cliaracter subset, which is linearly searched for a matching 
encoding value. However, this solution is costly in terms of time. 

The code points for the characters of a given language are not allocated sequentially in the Unicode 
code set. For example, the cliaracters which are used in the Japanese language do not occupy a single range 
of code points in the Unicode code set If the code points were arranged in a sequential range, a simple array 
of glyphs, wherein the size of the array is the size of the language-specific character subset, indexed by 
subtracting from the code point of the cliaracter sought the smallest code point in the subset would suffice. 
However, since the code points are not sequential, this solution is not realizable. 

Another solution contemplated is to provide a binary tree, or other tree configuration for arranging 
the glyph set. This solution is potentially superior to the encoding value/glyph pair solution in terms of time, 
and potentially superior to the array of glyphs indexed by Unicode code points solution in terms of memory 
storage space. 

The hash tables, however, provides an improved method over the previously contemplated methods 
for efficiently storing and quickly retrieving Unicode character glyphs as will be discussed with reference to 
Figs. 13 and 14. 

The number of hash bins 1 72 and the number of glyphs 1 80a in the glyph set 94a is equal to the 
number of Unicode cliaracters whose glyphs are present in the glyph set 94a. Preferably, the Japanese 
characters of the Unicode character set are stored in the hash table 1 60. Preferably, the number of hash table 
elements is 2048. 

Referring now to Fig. 1 3, a flowchart illustrating steps taken to efficiently store a character subset 
of the Unicode character set in the hash tables of Figs. 10-12 of tlie set top box 1 8 (of Fig. 2) is shown. 
Preferably, tlie steps of tlie method Fig. 1 3 are performed by a computer program, referred to as the table 
generator, which generates a source code file, such as a C language source code file, which includes the data 
structures described in Figs. 10-12. The source code files are used to compile and link the operating 
environment 70 for programming the ROM 30 (of Fig. 2). Alternatively, the steps may be performed by a 
human to generate the source code files, i.e., the table generator may be a human and/or modification of the 
computer program-generated source code file. For example, the characters in a given hash bin list may be 
reordered such tliat more frequently accessed characters are placed nearer the front of the list to reduce 
lookup time. 

The table generator receives a subset of Unicode cliaracters in step 200. The character subset 
comprises an encoding value and glyph for each of the characters in the subset Preferably, the subset is the 
Unicode characters used in the Japanese language. 

The table generator allocates storage for the hash table 160 (of Figs. 10-12) in step 202. 
Preferably, allocating storage comprises generating source code data structures for programming into the 
ROM 30 (of Fig. 2). Allocating storage for the hash table comprises allocating storage for the array of hash 
table elements 162 (of Figs. 10-12). Allocating storage for a hash table element comprises allocating 
storage for a hash bin list reference, such as hash bin list reference 164a (of Fig. 10). In the case of Figs. 10 
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and 1 1 , allocating storage for a hash table element further comprises allocating storage for a liash bin count, 
such as hash bin count 166a (of Fig. 10). 

The table generator determines if more cliaracters of the subset need to be stored in step 204. If 
noL all tlie characters of the subset liave been stored in tlie hash table. 
5 If a new cliaracter is to be stored, the table generator allocates storage for a new hash bin associated 

with the new cliaracter in step 206. Allocating storage for a hash bin comprises allocating storage for an 
encoding value 1 74a, and a glyph 1 80a. In the case of Fig. 1 0, allocating storage for a new hash bin further 
comprises allocating storage for a glyph reference 1 76a. In the case of Fig. 1 2, allocating storage for a new 
hash bin further comprises allocating storage for a next bin reference 1 90a. 
1 0 Tlie table generator stores the encoding value and glyph of the new cliaracter in the newly allocated 

hash bin in step 208. In the case of Fig. 10, the table generator also populates the glyph reference, such as 
glyph reference 1 76a (of Fig. 10) with a reference to the glyph. 

The table generator calculates an index into the liash table 160 in step 210 according to the 
following equation: 

1 5 index = ((((encoding jvalue & MASK1 ) » SHIFTVAL) A encoding_value) & MASK2). 

Preferably the constant MASK1 lias a value of Oxff, die constant MASK2 has a value of 0x7ff, and 
the constant SHIFTVAL lias a value of 8. The encoding value is the encoding value of the new character to 
be stored in tlie hash table. The operation is a bitwise logical AND operation, the <fcA " operation is a 
bitwise logical EXCLUSIVE OR operation, and the "»" operation is a bitwise logical SHIFT RIGHT 
20 operation by the number of bits specified by tlie SHIFTVAL constant. 

Preferably, the number of hash table elements is 2048. Using the preferred hash table size, and 
hashing index equation with the preferred constants to store the Unicode characters used in the Japanese 
language, advantageously yields a hash table in which only 44 of the 2048 hash table entries are empty, the 
average hash bin list length, i.e., the average number of ^collisions", is 3, the maximum hash bin list length 
25 is 8, and 80% of tlie characters hash to a hash bin list of length of 5 or less. Thus, the haslung method 

provides an efficient method for storing and a quick method for retrieving Japanese Unicode characters. Hie 
present method yields a distribution of hash bin list lengths as shown in Table I. 
Table I 

Hash bin list length number of lists of this length 

0 44 

1 171 

2 404 

3 517 

4 456 

5 264 

6 148 

7 35 

8 9 
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In step 2 1 2 the table generator calculates a reference to a current hash bin list by indexing into the 
hash table using die index which was calculated in step 2 1 0. The current hash bin list reference is the liash 
bin list reference of the indexed liash table element. The table generator adds the new hash bin which was 
allocated in step 206 to the current hash bin list in step 214. 
5 With reference to Figs. 10 and 11, adding the new hash bin to the hash bin list includes 

incrementing the hash bin list count and determining if the hash bin list is empty. If the hash bin list is empty, 
the hash bin list reference is assigned a reference to the new hash bin 

With reference to Fig. 1 2, adding the new liash bin to Utc liash bin list includes assigning the "next" 
liash bin reference of the new liash bin a terminating value, preferably NULL, and determining if the hash bin 

1 0 list is empty. If the liash bin list is empty, the hash bin list reference is assigned a reference to the new liash 
bin If the liash bin list is not empty, tlie tail of the liash bin list is found and the "next" liash bin reference of 
the tail hash bin is assigned a reference to the new liash bin. 

After adding the new hash bin to the hash bin list, tlie table generator returns to step 204 to 
determine if more cliaractcrs of the subset need to be stored. 

1 5 Referring now to Fig. 14, a flowchart illustrating steps taken to quickly retrieve a Unicode character 

glyph from tlie hash tables of Figs. 10-12 of the set top box of Fig. 2 is shown. The steps to retrieve a 
Unicode character glyph from tlie hash tables are performed by a rendering engine, such as rendering engine 
92a (of Fig. 6). The rendering engine 92a retrieves the Unicode cliaracter glyph to use in rendering tlie 
specified Unicode cliaracter in response to a request from the Unicode encoding engine 84 (of Fig. 6) to 

20 render a Unicode string. Tlie rendering engine 92a receives an encoding value, or code point, corresponding 
to a Unicode cliaracter to be rendered in step 220. 

Tlie rendering engine 92a calculates an index into the liash table 160 in step 222 according to the 
same equation used in step 210 (of Fig. 13) to calculate tlie index. Tlie description of Fig. 13 provides a 
detailed description of tlie hasliing equation. The description of Fig. 13 also discusses the various list length 

25 results, shown in Table I, of the hashing method, which are pertinent to the retrieval lime associated with the 
method. 

In step 224 the rendering engine 92a calculates a reference to a hash bin list by indexing into tlie 
hash table using the index which was calculated in step 222. Tlie hash bin list reference is the hash bin list 
reference of the indexed liash table element The rendering engine 92a assigns a current liash bin reference 
to the first hash bin in the hash bin list in step 226 . 

Tlie rendering engine 92a determines if the end of tlie liash bin list lias been readied in step 228. 
Determining if the end of the hash bin list has been reached includes determining if the number of liash bins 
visited in hash bin list is greater than the hash bin list count value in the case of Figs. 10 and 1 1 . "Visiting" a 
hash bin means assigning the current liash bin reference to a new value, as is performed in steps 226 and 
236. Determining if the end of the hash bin list lias been reached includes detemiining if tlie current liash bin 
reference has a NULL value in the case of Fig. 1 2. 

If the end of the hash bin list has been readied, then tlie glyph for the Unicode character 
corresponding to the encoding value which was received in step 220 is not present in tlie hash table 160. In 
this case, in step 230 the rendering engine 92a searches the list of downloaded glyphs 97 (of Fig. 6) to find 
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the glyph for the Unicode character corresponding to the encoding value which was received in step 220. If 
the glyph is not present in the list of downloaded glyphs 97 then an error has occurred. 

If the end of the liash bin list lias not been readied, in step 232, the rendering engine 92a 
determines if the encoding value of the current hash bin is equal to tlx; encoding value wliich was received in 
step 220. If not the current hash bin reference is assigned a reference to the next hash bin in the liash bin list 
in step 236 and the rendering engine 92a returns to step 228. However, if the encoding values are equal, the 
rendering engine 92a returns a reference to the glyph of the current hash bin in step 234. 

Although the system and method of Hie present invention has been described in connection with the 
preferred embodiment, it is not intended to be limited to the specific form set forth herein, but on tlie 
contrary, it is intended to cover such alternatives, modifications, and equivalents, as can be reasonably 
included within the spirit and scope of the invention as defined by tlie appended claims. 
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WHAT IS CLAIMED IS: 

1 . A television system comprising: 

a set top box comprising: 

a receiver configured to receive an application program from a broadcast center, wherein 
said application program includes a first character for display on a television, 
wherein said first cliaracler has an associated encoding value from a set of 
encoding values, wherein said set of encoding values comprises encoding values 
for characters from a plurality of languages; 

a processor configured to operably receive said application program from said receiver 
and execute said application program; 

a language detector configured to determine a first language associated with said first 
character from said plurality of languages; and 

a first rendering engine associated with said first language for rendering said first 
20 character for display on the television. 

2. The system as recited in claim 1 , wherein said set top box further comprises a glyph describing a 
graphic representation of said first cliaracter, wherein said glyph corresponds to said encoding value of said 
first cliaracter, wherein said first rendering engine uses said glyph to render said first character 

25 

3 . Tlie system as recited in claim 1 , wherein said application program further comprises a glyph 
describing a graphic representation of said first cliaracter, wherein said glyph corresponds to said encoding 
value of said first cliaracter, wherein said first rendering engine uses said glyph to render said first character. 

30 4. The system as recited in claim 1 , wherein said application program includes a second character for 
display on the television, wlierein said second character has an associated encoding value from said set of 
encoding values. 

5. The system as recited in claim 4, wherein said language detector is configured to determine a 
3 5 second language associated with said second character from said plurality of languages. 

6. Tlie system as recited in claim 5, wherein said set top box further comprises a second rendering 
engine associated with said second language for rendering said second cliaracter for display on the television. 

21 
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7. The system as recited in claim 6, wherein said set top box further comprises a glyph describing a 
graphic representation of said second character, wherein said glyph corresponds to said encoding value of 
said second cliaracter, wherein said second rendering engine uses said glyph to render said second cliaracter. 

8. The system as recited in claim 5, wherein said first rendering engine is also associated with said 
second language for rendering said second cliaracter for display on the television. 

9. The system as recited in claim 1 , wherein said set top box further comprises a de-multiplexer 
coupled to said receiver. 

10. The system as recited in claim 9, wherein said receiver is configured to receive an audio/video data 
stream for display on the television concurrently with said first character and multiplexed with said 
application program, wlierein said de-multiplexer is configured to de-multiplex said audio/video data stream 
from said application program. 

1 1 . The system as recited in claim 1 , wherein said set top box further comprises a port for receiving 
user input from a user input device. 

12. The system as recited in claim 1 1 , wherein said set top box is configured to communicate said user 
20 input to a remote computer system. 

1 3 . The system as recited in claim 1 , wherein said set top box further comprises a memory coupled to 
said receiver configured to store said application program. 

25 14. The system as recited in claim 1, wherein said set top box further comprises an encoding engine 
configured to receive said first character from said application program, and to invoke said language detector 
and said first rendering engine. 



15 



15. The system as recited in claim 14, wherein said encoding engine comprises functions callable by 
30 said application program for manipulating strings of characters, wherein each of said characters lias an 

encoding value from said set of encoding values. 

16. The system as recited in claim 1, wherein said set of encoding values comprises the Unicode set of 
character encoding values. 

35 

17. The system as recited in claim 1 , wherein said set top box further comprises a memory coupled to 
said receiver configured to store a plurality of glyphs. 
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18. 



A method for displaying a string of characters in a television system, wherein the television system 
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30 



comprises a set top box comprising a receiver, a processor, a language detector, and a first rendering engine, 
the method comprising: 

receiving into said set top box an application program from a broadcast center, wherein said 

application program includes a first cliaractcr for display on a television, wherein said lirsl 
character lias an associated encoding value from a set of encoding values, wherein said set 
of encoding values comprises encoding values for characters from a plurality of 
languages; 

executing said application program on said processor; 

said language detector determining a first language associated with said first character from said 
plurality of languages; and 

said first rendering engine rendering said first character for display on the televisioa 

19. The method as recited in claim 1 8, wherein said set top box further comprises a glyph describing a 
graphic representation of said first character, wherein said glyph corresponds to said encoding value of said 
first character, wherein said rendering comprises using said glyph to render said first character. 

20. The method as recited in claim 1 8, wherein said application program further comprises a glyph 
describing a graphic representation of said first cliaracter, wlierein said glyph corresponds to said encoding 
value of said first character, wherein said rendering comprises using said glyph to render said first character. 

21 . The method as recited in claim 18, wlierein said application program includes a second character 
for display on the television, wherein said second character has an associated encoding value from said set of 
encoding values. 

22. The method as recited in claim 21, further comprising said language detector determining a second 
language associated with said second character from said plurality of languages. 

23. The method as recited in claim 22, wlierein said set top box further comprises a second rendering 
engine associated with said second language, wherein said method further comprises said second rendering 
engine rendering said second character for display on the televisioa 

24. The method as recited in claim 23, wherein said set top box further comprises a glyph describing a 
graphic representation of said second character, wherein said glyph corresponds to said encoding value of 
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said second cliaracter, wherein said second rendering engine rendering said second character comprises 
using said glyph. 



25 . The method as recited in claim 2 1 , further comprising said first rendering engine rendering said 
5 second character for display on the television. 

26. Hie metliod as recited in claim 1 8, wherein said receiver is configured to receive an audio/video 
data stream multiplexed with said application program wlierein said method further comprises de- 
multiplexing said audio/video data stream from said application program 

10 

27. Tlie metliod as recited in claim 1 8, further comprising receiving an audio/video data stream from 
the broadcast center, and displaying the audio/video data stream on the television concurrently with said first 
cliaracter. 

1 5 28. The method as recited in claim 1 8, further comprising receiving user input from a user input 
device. 

29. Tlie method as recited in claim 28 ? further comprising communicating said user input to a remote 
computer system 

30. The metliod as recited in claim 1 8, further comprising storing said application program in a 
memory of said set top box. 

3 1 . The metliod as recited in claim 18, wherein said set of encoding values comprises the Unicode set 
of cliaracter encoding values. 

32. The metliod as recited in claim 18, further comprising storing a plurality of glyphs in a memory of 
said set top box. 

33. A television system comprising: 

a broadcast center configured to transmit an application program and a glyph, wherein said 

application program includes a character for display on a television, wherein said glyph 
describes a graphic representation of said character, and 

a set top box, comprising: 

a receiver configured to receive said application program and said glyph from said 
broadcast center, 
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a processor configured to operably receive said application program from said receiver 
and execute said application program; and 

5 a rendering engine for rendering said cliaracter for display on the television, wherein said 

rendering engine uses said glyph lo render said character for display on the 
television. 

34. The system as recited in claim 33, wherein said glyph is included in a resource file received with 
10 said application program from said broadcast center. 

35. The system as recited in claim 33, wherein said set top box further comprises an operating 
environment configured lo add said glyph to a list of downloaded glyphs. 

15 36. The system as recited in claim 33, wherein said set top box further comprises a glyph set, wherein 
said rendering engine uses said glyph set to render characters. 

37. The system as recited in claim 36, wherein said rendering engine is configured to determine if a 
glyph for said character is not present in said glyph set 

20 

38. The system as recited in claim 37, wherein said rendering engine is configured to search said list of 
downloaded glyphs if said glyph for said character is not present in said glyph set. 

39. The system as recited in claim 33, wherein said set top box further comprises a memory operably 
25 coupled to said receiver configured to store said glyph. 

40. The system as recited in claim 39, wherein said memory is a random access memory. 

4 1 . The system as recited in claim 39, wherein said memory is a flash memory. 

30 

42. The system as recited in claim 33, wherein said set top box further comprises an encoding engine 
configured to receive said character from said application program, and to invoke said rendering engine. 

43. The system as recited in claim 33, wherein said character has an encoding value according to the 
3 5 Unicode set of character encoding values. 

44. A method for displaying a string of cliaracters in a television system, wherein the television system 
comprises a set top box comprising a processor and a memory, the method comprising: 
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transmitting an application program and a glyph, wherein said application program includes a 

character for display on a television, wlierein said glyph describes a graphic representation 
of said character; 

5 receiving said application program and said glyph; 

executing said application program; and 

rendering said character for display on the television using said glyph to render said character for 
1 0 display on the television. 

45. The method as recited in claim 44, wherein said transmitting said application program and said 
glyph comprises transmitting a resource file including said glyph. 

1 5 46. The method as recited in claim 44 wherein said receiving said application program and said glyph 
comprises receiving said resource file including said glyph. 

47. The method as recited in claim 44, furtlicr comprising adding said glyph to a list of downloaded 
glyphs. 

20 

48. The metlKKl as recited in claim 47, further comprising determining if a glyph for said character is 
not present in a glyph set of said set top box. 

49. The method as recited in claim 48, further comprising searching said list of downloaded glyplis if 
25 said glyph for said character is not present in said glyph set. 

50. The method as recited in claim 44, further comprising storing said glyph in the memory of said set 
top box. 

30 51. The method as recited in claim 44, wherein said character has an encoding value according to the 
Unicode set of character encoding values. 

52. A set top box for receiving from a broadcast center an application program and a glyph, wherein 
the application program includes a character for display on a television, wherein the glyph describes a 
35 graphic representation of the character, the set top box comprising: 

a receiver configured to receive said application program and said glyph from the broadcast center, 
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a processor configured to operably receive said application program from said receiver and execute 
said application program; and 



a rendering engine for rendering said cliaraclcr for display on the television, wherein said rendering 
5 engine uses said glyph to render said character for display on tlic television. 

53. The set top box as recited in claim 52, wherein said glyph is included in a resource file received 
with said application program from the broadcast center. 

10 54. The set lop box as recited in claim 52, wherein said set top box further comprises an operating 
environment configured to add said glyph to a list of downloaded glyphs. 

55. The set top box as recited in claim 52, further comprising a glyph set, wherein said rendering 
engine uses said glyph set to render characters. 

15 

56. The set top box as recited in claim 55, wherein said rendering engine is configured to determine if a 
glyph for said character is not present in said glyph set 

57. The set top box as recited in claim 56, wherein said rendering engine is configured to search said 
20 list of downloaded glyphs if said glyph for said character is not present in said glyph set. 

58. The set top box as recited in claim 52, further comprising a memory operably coupled to said 
receiver configured to store said glyph. 

25 59. The set top box as recited in claim 58, wherein said memory is a random access memory. 

60. The set top box as recited in claim 58, wherein said memory is a flash memory. 

6 1 . The set top box as recited in claim 52, further comprising an encoding engine configured to receive 
30 said character from said application program, and to invoke said rendering engine. 

62. The set top box as recited in claim 52, wherein said character has an encoding value according to 
the Unicode set of character encoding values. 

35 63. A method for efficiently storing a subset of cliaracters from a set of characters, wherein the set of 
characters are encoded according to a set of encoding values, wherein each of me characters has a unique 
associated encoding value, the method comprising: 
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receiving acharacter of said subset of characters, wherein said character comprises an encoding 
value and a glyph describing a graphic representation of said character: 

allocating storage for a hash bin comprising storage for an encoding value and a glyph; 

storing said encoding value and said glyph in said hash bin; 

calculating an index according to an equation 

index = ((((encoding_value & MASK1) » SHEFTVAL ) A encoding_value) & MASK2), 

wherein said MASK1 , MASK2, and SfflFTVAL are predefined constants; 

calculating a reference to a hash bin list by indexing into a hash table of references to hash bin lists 
using said index; 

adding said liash bin to said hash bin list; 

performing said receiving, said allocating, said storing, said calculating an index, said calculating a 
reference, and said adding for each cliaracter in said subset of characters. 

64. The method as recited in claim 63, wherein said set of encoding values comprises the Unicode set 
of encoding values. 

65. The method as recited in claim 63, wherein said subset of characters comprises characters from at 
2 5 least one language. 

66. The method as recited in claim 65, wherein said at least one language includes at least one from the 
group comprising: Japanese, Chinese, Korean, Thai, Arabic, Indie, and European languages. 

30 67. The method as recited in claim 63, wherein said MASK1 constant is OxOff, said MASK2 constant is 
0x07ff, and said SHEFTVAL constant is 8. 

68. The method as recited in claim 63, wherein said hash table of references comprises 2048 references 
to hash bin lists. 
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69. The method as recited in claim 63, wlierein said hash bin further comprises a reference to said 
glyph, wherein said storage for said bin comprises a first portion for said glyph and a second portion for said 
encoding value and said reference. 
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70. The method as recited in claim 63, wliercin said hash bin list comprises a linked list of hash bins. 

7 1 . The method as recited in claim 63, wherein said hash bin list comprises an array of hash bins. 

5 72. The method as recited in claim 70, wherein said reference to said hash bin list comprises a count of 
liash bins in said hash bin list. 

73. The method as recited in claim 63, further comprising allocating storage for said hash tabic prior to 
said allocating storage for a hash bin. 

10 

74. A method for quickly retrieving a character of a subset of characters from a set of characters, 
wherein tlie set of cliaracters are encoded according to a set of encoding values, wherein each of the 
characters lias a unique associated encoding value, the method comprising: 

1 5 receiving an encoding value of a character in said subset of cliaracters; 

calculating an index according to an equation, 

ndex = ((((encoding_value & MASK1) » SHEFTVAL ) A encoding_value) & MASK2), 

20 wherein said MASK1 , MASK2, and SHIFTVAL are predefined constants; 

calculating a reference to a hash bin list by indexing into a hash table of references to liash bin lists 
using said index, wherein a liash bin includes an encoding value and a glyph describing a 
grapliic representation of a cliaracter, 

25 

searching said hash bin list for a hash bin having an encoding value equal to said encoding value of 
said cliaracter, and 

returning a glyph of said hash bin having an encoding value equal to said encoding value of said 
30 cliaracter. 

75. The method as recited in claim 74, wherein said searching comprises: 
referencing a first hash bin of said hash bin list as a current hash bin; 

35 

determining if an encoding value of said current liash bin is equal to said encoding value of said 
character, 
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assigning said current hash bin to be a next liash bin in said list of hash bins if said encoding value 
of said current hash bin is not equal to said encoding value of said character, and 

performing said determining and said assigning until said encoding value of said current hash bin is 
5 equal to said encoding value of said character. 

76. The method as recited in claim 74, wherein said set of encoding values comprises the Unicode set 
of encoding values. 

10 77. The method as recited in claim 74, wherein said subset of characters comprises characters from at 
least one language. 

78. Tlie method as recited in claim 77, wherein said at least one language includes at least one from Uic 
group comprising: Japanese, Cliinese, Korean, Thai, Arabic, Indie, and European languages. 

15 

79. The method as recited in claim 74, wherein said MASK1 constant is OxOff, said MASK2 constant is 
0x07fT, and said SHIFTVAL constant is 8. 

80. The method as recited in claim 74, wherein said liash table of references comprises 2048 references 
20 to liash bin lists. 

8 1 . The method as recited in claim 74, wherein said liash bin further comprises a reference to said 
glyph. 

25 82. The method as recited in claim 74, wherein said hash bin list comprises a linked list of hash bins. 

83 . The method as recited in claim 74, wherein said liash bin list comprises an array of liash bins. 

84. The method as recited in claim 83, wherein said reference to said hash bin list comprises a count of 
30 hash bins in said hash bin list. 

85. A set top box for displaying a character on a television, wherein the character is of a subset of 
characters from a set of characters, wherein the set of characters are encoded according to a set of encoding 
values, wherein each of the characters has a unique associated encoding value, comprising: 



35 



a processor configured to execute program instructions; and 

a memory coupled to said processor configured to store said program instructions, wherein said 
program instructions comprise: 

30 
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instructions for receiving an encoding value of a character in said subset of cliaraclers; 
instructions for calculating an index according to an equation, 

index = ((((cncoding_valuc & MASK1 ) » SHIFTVAL ) A encoding^vaJuc) & MASK2), 

5 wherein said MASK1 , MASK2, and SHfFTVAL arc predefined constants; 

instructions for calculating a reference to a hash bin list by indexing into a hash table of 
references to hash bin lists using said index, wherein a hash bin comprises 
storage for an encoding value and a glyph describing a graphic representation of 
10 said character; 

instructions for searching said liash bin list for a hash bin having an encoding value equal 
to said encoding value of said cliaracter; 

1 5 instructions for returning a glyph of said hash bin liaving an encoding value equal to said 

encoding value of said character, and 

instructions for rendering said glyph for display on the television 

20 86. The set top box as recited in claim 85, wlicrein said set of encoding values comprises the Unicode 
set of encoding values. 

87. The set top box as recited in claim 85, wherein said subset of characters comprises characters from 
at least one language. 

25 

88. The set lop box as recited in claim 87, wherein said at least one language includes at least one from 
the group comprising: Japanese, Chinese, Korean. 

89. The set top box as recited in claim 85, wherein said MASK1 constant is OxOff, said MASK2 
30 constant is 0x07ff, and said SHIFTVAL constant is 8. 

90. The set top box as recited in claim 85, wherein said hash table of references comprises 2048 
references to hash bin lists. 

35 91 . The set top box as recited in claim 85, further comprising instructions for allocating storage for said 
liash table prior to said allocating storage for a hash bin. 

92. A set top box for displaying a character on a television, wherein the cliaracter is of a subset of 
characters from a set of cliaraclers, wherein the set of characters are encoded according to a set of encoding 
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values, wherein each of the characters has a unique associated encoding value, comprising: 

a processor configured to execute program instructions; and 

5 a memory coupled to said processor configured to store said program instructions, wliercin said 

program instructions comprise: 

instructions for receiving an encoding value of a character in said subset of characters; 

1 0 instructions for calculating an index according to an equation, 

index = ((((encodingvalue & MASK1 ) » SHIFTVAL ) A encoding^value) & MASK2); 

wherein said MASK1, MASK2, and SHIFTVAL are predefined constants; 

1 5 instructions for calculating a reference to a hash bin list by indexing into a hash table of 

references to hash bin lists using said index, wherein a hash bin includes an 
encoding value and a glyph describing a graphic representation of said character 

instructions for searching said hash bin list for a hash bin having an encoding value equal 
20 to said encoding value of said character, and 

instructions for returning a glyph of said hash bin having an encoding value equal to said 
encoding value of said character 

25 93 . The set top box as recited in claim 92, wherein said instructions for searching comprise: 

instructions for referencing a first hash bin of said hash bin list as a current hash bin; 

instructions for determining if an encoding value of said current hash bin is equal to said encoding 
30 value of said character, 

instructions for assigning said current hash bin to be a next hash bin in said list of hash bins if said 
encoding value of said current hash bin is not equal to said encoding value of said 
character, and 



35 



instructions for performing said determining and said assigning until said encoding value of said 
current hash bin is equal to said encoding value of said character 
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94. The set top box as recited in claim 92, wherein said set of encoding values comprises the Unicode 
set of encoding values. 

95. The set top box as recited in claim 92, wherein said subset of characters comprises cliaractcrs from 
5 at least one language. 



the group comprising: Japanese, Chinese, Korean. 

10 97. The set top box as recited in claim 92, wlierein said MASK1 constanl is OxOff, said MASK2 
constant is 0x()7fT, and said SHIFTVAL constant is 8. 

98. The set top box as recited in claim 92, wherein said hash table of references comprises 2048 
references to hash bin lists. 

15 



96. 



The set top box as recited in claim 95, wherein said at least one language includes at least one from 
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